Quantum mechanics |
---|
Uncertainty principle
|
Introduction Glossary · History |
Background
|
Fundamental concepts
|
Formulations
|
Equations
|
Advanced topics
|
Scientists
Bell · Bohm · Bohr · Born · Bose
de Broglie · Dirac · Ehrenfest Everett · Feynman · Heisenberg Jordan · Kramers · von Neumann Pauli · Planck · Schrödinger Sommerfeld · Wien · Wigner |
In quantum mechanics, the Heisenberg uncertainty principle states a fundamental limit on the accuracy with which certain pairs of physical properties of a particle, such as position and momentum, can be simultaneously known. In layman's terms, the more precisely one property is measured, the less precisely the other can be controlled, determined, or known.
In his Nobel Laureate speech, Max Born said:
“ | ...To measure space coordinates and instants of time, rigid measuring rods and clocks are required. On the other hand, to measure momenta and energies, devices are necessary with movable parts to absorb the impact of the test object and to indicate the size of its momentum. Paying regard to the fact that quantum mechanics is competent for dealing with the interaction of object and apparatus, it is seen that no arrangement is possible that will fulfill both requirements simultaneously...[1] | ” |
Published by Werner Heisenberg in 1927, the uncertainty principle was a key discovery in the early development of quantum theory. It implies that it is impossible to simultaneously measure the present position while also determining the future motion of a particle, or of any system small enough to require quantum mechanical treatment.[2] Intuitively, the principle can be understood by considering a typical measurement of a particle. It is impossible to determine both momentum and position by means of the same measurement, as indicated by Born above. Assume that its initial momentum has been accurately calculated by measuring its mass, the force applied to it, and the length of time it was subjected to that force. Then to measure its position after it is no longer being accelerated would require another measurement to be done by scattering light or other particles off of it. But each such interaction will alter its momentum by an unknown and indeterminable increment, degrading our knowledge of its momentum while augmenting our knowledge of its position. So Heisenberg argues that every measurement destroys part of our knowledge of the system that was obtained by previous measurements.[3] The uncertainty principle states a fundamental property of quantum systems, and is not a statement about the observational success of current technology.[2]
The principle states specifically that the product of the uncertainties in position and momentum is always equal to or greater than one half of the reduced Planck constant ħ, which is defined as the re-scaling h/(2π) of the Planck constant h. Mathematically, the uncertainty relation between position and momentum arises because the expressions of the wavefunction in the two corresponding bases are Fourier transforms of one another (i.e., position and momentum are conjugate variables). In the mathematical formulation of quantum mechanics, any non-commuting operators are subject to similar uncertainty limits.
Contents |
Werner Heisenberg formulated the Uncertainty Principle at Niels Bohr's institute in Copenhagen, while working on the mathematical foundations of quantum mechanics.
In 1925, following pioneering work with Hendrik Kramers, Heisenberg developed matrix mechanics, which replaced the ad-hoc old quantum theory with modern quantum mechanics. The central assumption was that the classical concept of motion does not fit at the quantum level, and that electrons in an atom do not travel on sharply defined orbits. Rather, the motion is smeared out in a strange way: the Fourier transform of time only involve those frequencies that could be seen in quantum jumps.
Heisenberg's paper did not admit any unobservable quantities like the exact position of the electron in an orbit at any time; he only allowed the theorist to talk about the Fourier components of the motion. Since the Fourier components were not defined at the classical frequencies, they could not be used to construct an exact trajectory, so that the formalism could not answer certain overly precise questions about where the electron was or how fast it was going.
The most striking property of Heisenberg's infinite matrices for position and momentum is that they do not commute. Heisenberg's canonical commutation relation indicates by how much:
and this result did not have a clear physical interpretation in the beginning.
In March 1926, working in Bohr's institute, Heisenberg realized that the non-commutativity implies the uncertainty principle. This implication provided a clear physical interpretation for the non-commutativity, and it laid the foundation for what became known as the Copenhagen interpretation of quantum mechanics. Heisenberg showed that the commutation relation implies an uncertainty, or in Bohr's language a complementarity.[4] Any two variables that do not commute cannot be measured simultaneously — the more precisely one is known, the less precisely the other can be known. Heisenberg wrote:
It can be expressed in its simplest form as follows: One can never know with perfect accuracy both of those two important factors which determine the movement of one of the smallest particles—its position and its velocity. It is impossible to determine accurately both the position and the direction and speed of a particle at the same instant.[5]
One way to understand the complementarity between position and momentum is by wave-particle duality. If a particle described by a plane wave passes through a narrow slit in a wall like a water-wave passing through a narrow channel, the particle diffracts and its wave comes out in a range of angles. The narrower the slit, the wider the diffracted wave and the greater the uncertainty in momentum afterward. The laws of diffraction require that the spread in angle is about , where is the slit width and is the wavelength. Reasoning based on the de Broglie relation shows that the size of the slit and the range in momentum of the diffracted wave are related by Heisenberg's rule:
In his celebrated 1927 paper, "Über den anschaulichen Inhalt der quantentheoretischen Kinematik und Mechanik" ("On the Perceptual Content of Quantum Theoretical Kinematics and Mechanics"), Heisenberg established this expression as the minimum amount of unavoidable momentum disturbance caused by any position measurement,[6] but he did not give a precise definition for the uncertainties Δx and Δp. Instead, he gave some plausible estimates in each case separately. In his Chicago lecture[7] he refined his principle:
Kennard[8] in 1927 first proved the modern inequality:
where ħ = h/2π, and σx, σp are the standard deviations of position and momentum. Heisenberg himself only proved relation (2) for the special case of Gaussian states.[7] However, it should be noted that σx and Δx are not the same quantities. σx and σp as defined in Kennard are obtained by making repeated measurements of position on an ensemble of systems and by making repeated measurements of momentum on an ensemble of systems and calculating the standard deviation of those measurements. The Kennard expression, therefore, says nothing about the simultaneous measurement of position and momentum.
A rigorous proof of a new inequality for simultaneous measurements in the spirit of Heisenberg and Bohr has been given recently. The measurement process is as follows: Whenever a particle is localized in a finite interval Δx > 0, then the standard deviation of its momentum satisfies
while the equal sign is given for Cosine states.[9]
Throughout the main body of his original 1927 paper, written in German, Heisenberg used the word "Unbestimmtheit" ("indeterminacy") to describe the basic theoretical principle. Only in the endnote did he switch to the word "Unsicherheit" ("uncertainty"). However, when the English-language version of Heisenberg's textbook, The Physical Principles of the Quantum Theory, was published in 1930, the translation "uncertainty" was used, and it became the more commonly used term in the English language thereafter.[10]
One way in which Heisenberg originally argued for the uncertainty principle is by using an imaginary microscope as a measuring device.[7] He imagines an experimenter trying to measure the position and momentum of an electron by shooting a photon at it.
Problem 1 - If the photon has a short wavelength, and therefore a large momentum, the position can be measured accurately. But the photon scatters in a random direction, transferring a large and uncertain amount of momentum to the electron. If the photon has a long wavelength and low momentum, the collision does not disturb the electron's momentum very much, but the scattering will reveal its position only vaguely.
Problem 2 - If a large aperture is used for the microscope, the electron's location can be well resolved (see Rayleigh criterion); but by the principle of conservation of momentum, the transverse momentum of the incoming photon and hence the new momentum of the electron resolves poorly. If a small aperture is used, the accuracy of both resolutions is the other way around.
The combination of these trade-offs imply that no matter what photon wavelength and aperture size are used, the product of the uncertainty in measured position and measured momentum is greater than or equal to a lower limit, which is (up to a small numerical factor) equal to Planck's constant.[11] Heisenberg did not care to formulate the uncertainty principle as an exact limit, and preferred to use it as a heuristic quantitative statement, correct up to small numerical factors.
The Copenhagen interpretation of quantum mechanics and Heisenberg's Uncertainty Principle were in fact seen as twin targets by detractors who believed in an underlying determinism and realism. According to the Copenhagen interpretation of quantum mechanics, there is no fundamental reality that the quantum state describes, just a prescription for calculating experimental results. There is no way to say what the state of a system fundamentally is, only what the result of observations might be.
Albert Einstein believed that randomness is a reflection of our ignorance of some fundamental property of reality, while Niels Bohr believed that the probability distributions are fundamental and irreducible, and depend on which measurements we choose to perform. Einstein and Bohr debated the uncertainty principle for many years.
The first of Einstein's thought experiments challenging the uncertainty principle went as follows:
Bohr's response was that the wall is quantum mechanical as well, and that to measure the recoil to accuracy the momentum of the wall must be known to this accuracy before the particle passes through. This introduces an uncertainty in the position of the wall and therefore the position of the slit equal to , and if the wall's momentum is known precisely enough to measure the recoil, the slit's position is uncertain enough to disallow a position measurement.
A similar analysis with particles diffracting through multiple slits is given by Richard Feynman.[12]
Another of Einstein's thought experiments (Einstein's box) was designed to challenge the time/energy uncertainty principle. It is very similar to the slit experiment in space, except here the narrow window the particle passes through is in time:
Bohr spent a day considering this setup, but eventually realized that if the energy of the box is precisely known, the time at which the shutter opens is uncertain. If the case, scale, and box are in a gravitational field then, in some cases, it is the uncertainty of the position of the clock in the gravitational field that alters the ticking rate. This can introduce the right amount of uncertainty. This was ironic as it was Einstein himself who first discovered gravity's effect on time.
Bohr was compelled to modify his understanding of the uncertainty principle after another thought experiment by Einstein. In 1935, Einstein, Podolsky and Rosen (see EPR paradox) published an analysis of widely separated entangled particles. Measuring one particle, Einstein realized, would alter the probability distribution of the other, yet here the other particle could not possibly be disturbed. This example led Bohr to revise his understanding of the principle, concluding that the uncertainty was not caused by a direct interaction.[13]
But Einstein came to much more far-reaching conclusions from the same thought experiment. He believed the "natural basic assumption" that a complete description of reality would have to predict the results of experiments from "locally changing deterministic quantities", and therefore would have to include more information than the maximum possible allowed by the uncertainty principle.
In 1964 John Bell showed that this assumption can be falsified, since it would imply a certain inequality between the probabilities of different experiments. Experimental results confirm the predictions of quantum mechanics, ruling out Einstein's basic assumption that led him to the suggestion of his hidden variables. (Ironically this fact is one of the best pieces of evidence supporting Karl Popper's philosophy of invalidation of a theory by falsification-experiments. That is to say, here Einstein's "basic assumption" became falsified by experiments based on Bell's inequalities. For the objections of Karl Popper against the Heisenberg inequality itself, see below.)
While it is possible to assume that quantum mechanical predictions are due to nonlocal hidden variables, and in fact David Bohm invented such a formulation, this resolution is not satisfactory to the vast majority of physicists. The question of whether a random outcome is predetermined by a nonlocal theory can be philosophical, and it can be potentially intractable. If the hidden variables are not constrained, they could just be a list of random digits that are used to produce the measurement outcomes. To make it sensible, the assumption of nonlocal hidden variables is sometimes augmented by a second assumption — that the size of the observable universe puts a limit on the computations that these variables can do. A nonlocal theory of this sort predicts that a quantum computer encounters fundamental obstacles when it tries to factor numbers of approximately 10,000 digits or more; an achievable task in quantum mechanics.[14]
Karl Popper approached the problem of indeterminacy as a logician and metaphysical realist.[15][16] He disagreed with the application of the uncertainty relations to individual particles rather than to ensembles of identically prepared particles, referring to them as "statistical scatter relations".[15][17] In this statistical interpretation, a particular measurement may be made to arbitrary precision without invalidating the quantum theory. This directly contrasts the Copenhagen interpretation of quantum mechanics, which is non-deterministic but lacks local hidden variables.
In 1934, Popper published Zur Kritik der Ungenauigkeitsrelationen (Critique of the Uncertainty Relations) in Naturwissenschaften,[18] and in the same year Logik der Forschung (translated and updated by the author as The Logic of Scientific Discovery in 1959), outlining his arguments for the statistical interpretation. In 1982, he further developed his theory in Quantum theory and the schism in Physics, writing:
[Heisenberg's] formulae are, beyond all doubt, derivable statistical formulae of the quantum theory. But they have been habitually misinterpreted by those quantum theorists who said that these formulae can be interpreted as determining some upper limit to the precision of our measurements.[original emphasis][19]
Popper proposed an experiment to falsify the uncertainty relations, though he later withdrew his initial version after discussions with Weizsäcker, Heisenberg, and Einstein; this experiment may have influenced the formulation of the EPR experiment.[15][20] A version of this experiment was realized in 1999.[16]
The many-worlds interpretation originally outlined by Hugh Everett III in 1957 is partly meant to reconcile the differences between the Einstein and Bohr's views by replacing Bohr's wave function collapse with an ensemble of deterministic and independent universes whose distribution is governed by wave functions and the Schrödinger equation. Thus, uncertainty in the many-worlds interpretation follows from each observer within any universe having no knowledge of what goes on in the other universes.
According to the de Broglie hypothesis, every object in our Universe is a wave, a situation which gives rise to this phenomenon. Consider the measurement of the position of a particle. The particle's wave packet has non-zero amplitude, meaning that the position is uncertain – it could be almost anywhere along the wave packet. To obtain an accurate reading of position, this wave packet must be 'compressed' as much as possible, meaning it must be made up of increasing numbers of sine waves added together. The momentum of the particle is proportional to the wavenumber of one of these waves, but it could be any of them. So a more precise position measurement – by adding together more waves – means that the momentum measurement becomes less precise (and vice versa).
The only kind of wave with a definite position is concentrated at one point, and such a wave has an indefinite wavelength (and therefore an indefinite momentum). Conversely, the only kind of wave with a definite wavelength is an infinite regular periodic oscillation over all space, which has no definite position. So in quantum mechanics, there can be no states that describe a particle with both a definite position and a definite momentum. The more precise the position, the less precise the momentum.
A mathematical statement of the principle is that every quantum state has the property that the root mean square (RMS) deviation of the position from its mean (the standard deviation of the x-distribution):
times the RMS deviation of the momentum from its mean (the standard deviation of p):
can never be smaller than a fixed fraction of Planck's constant:
The uncertainty principle can be restated in terms of other measurement processes, which involves collapse of the wavefunction. When the position is initially localized by preparation, the wavefunction collapses to a narrow bump in an interval Δx > 0, and the momentum wavefunction becomes spread out. The particle's momentum is left uncertain by an amount inversely proportional to the accuracy of the position measurement:
If the initial preparation in Δx is understood as an observation or disturbance of the particles then this means that the uncertainty principle is related to the observer effect. However, this is not true in the case of the measurement process corresponding to the former inequality but only for the latter inequality.
The Heisenberg uncertainty relation and its more formal versions deal explicitly with the quantum operators for position, and for momentum, . Uncertainty relations for generalized operators have also been derived, such as the Robertson[21] uncertainty relation and for arbitrary Hermitian operators and is given by
A further generalization of the Robertson relation was derived by Schrödinger[22] to give
Since the Robertson and Schrödinger relations are for general operators, then they can be used to obtain uncertainty relations for any two observables that do not commute. Examples include:
Other than the position-momentum uncertainty relation, the most important uncertainty relation is that between energy and time. The energy-time uncertainty relation is not, however, an obvious consequence of the general Robertson–Schrödinger relation. Since energy bears the same relation to time as momentum does to space in special relativity, it was clear to many early founders, Niels Bohr among them, that the following relation should hold:[6][7]
but it was not always obvious what precisely meant. The problem is that the time at which the particle has a given state is not an operator belonging to the particle, it is a parameter describing the evolution of the system. As Lev Landau once joked "To violate the time-energy uncertainty relation all I have to do is measure the energy very precisely and then look at my watch!" [25]
Nevertheless, Einstein and Bohr understood the heuristic meaning of the principle. A state that only exists for a short time cannot have a definite energy. To have a definite energy, the frequency of the state must accurately be defined, and this requires the state to hang around for many cycles, the reciprocal of the required accuracy.
For example, in spectroscopy, excited states have a finite lifetime. By the time-energy uncertainty principle, they do not have a definite energy, and each time they decay the energy they release is slightly different. The average energy of the outgoing photon has a peak at the theoretical energy of the state, but the distribution has a finite width called the natural linewidth. Fast-decaying states have a broad linewidth, while slow decaying states have a narrow linewidth.
The broad linewidth of fast decaying states makes it difficult to accurately measure the energy of the state, and researchers have even used microwave cavities to slow down the decay-rate, to get sharper peaks.[26] The same linewidth effect also makes it difficult to measure the rest mass of fast decaying particles in particle physics. The faster the particle decays, the less certain is its mass.
One false formulation of the energy-time uncertainty principle says that measuring the energy of a quantum system to an accuracy requires a time interval . This formulation is similar to the one alluded to in Landau's joke, and was explicitly invalidated by Y. Aharonov and D. Bohm in 1961 . The time in the uncertainty relation is the time during which the system exists unperturbed, not the time during which the experimental equipment is turned on, whereas the position in the other version of the principle refers to where the particle has some probability to be and not where the observer might look.
Another common misconception is that the energy-time uncertainty principle says that the conservation of energy can be temporarily violated – energy can be "borrowed" from the Universe as long as it is "returned" within a short amount of time.[27] Although this agrees with the spirit of relativistic quantum mechanics, it is based on the false axiom that the energy of the Universe is an exactly known parameter at all times. More accurately, when events transpire at shorter time intervals, there is a greater uncertainty in the energy of these events. Therefore it is not that the conservation of energy is violated when quantum field theory uses temporary electron-positron pairs in its calculations, but that the energy of quantum systems is not known with enough precision to limit their behavior to a single, simple history. Thus the influence of all histories must be incorporated into quantum calculations, including those with much greater or much less energy than the mean of the measured/calculated energy distribution.
In 1932 Dirac offered a precise definition and derivation of the time-energy uncertainty relation in a relativistic quantum theory of "events".[28] But a better-known, more widely used formulation of the time-energy uncertainty principle was given in 1945 by L. I. Mandelshtam and I. E. Tamm, as follows.[29] For a quantum system in a non-stationary state and an observable represented by a self-adjoint operator , the following formula holds:
where is the standard deviation of the energy operator in the state , stands for the standard deviation of . Although, the second factor in the left-hand side has dimension of time, it is different from the time parameter that enters Schrödinger equation. It is a lifetime of the state with respect to the observable . In other words, this is the time after which the expectation value changes appreciably.
While formulating the many-worlds interpretation of quantum mechanics in 1957, Hugh Everett III discovered a much stronger formulation of the uncertainty principle.[30] In the inequality of standard deviations, some states, like the wavefunction
have a large standard deviation of position, but are actually a superposition of a small number of very narrow bumps. In this case, the momentum uncertainty is much larger than the standard deviation inequality would suggest. A better inequality uses the Shannon entropy of the distribution, a measure of the uncertainty in a random variable described by a probability distribution
where n is an arbitrary base for the logarithm. The interpretation of Hx is that, for example if n=2, it is the number of bits of information an observer acquires when the value of x is given to accuracy is equal to . The second part is just the number of bits past the decimal point, the first part is a logarithmic measure of the width of the distribution. For a uniform distribution of width the information content is bits. This quantity can be negative, which means that the distribution is narrower than one unit, so that learning the first few bits past the decimal point gives no information since they are not uncertain.
Taking the logarithm of Heisenberg's formulation of uncertainty:
but the lower bound is not precise.
Everett (and Hirschman[31]) conjectured that for all quantum states (using natural units):
This was proven by Beckner [32] and by Iwo Bialynicki-Birula and Jerzy Mycielski[33] in 1975. The equality is attained when is a normalized Gaussian.[34]
In the context of harmonic analysis, a branch of mathematics, the uncertainty principle implies that one cannot at the same time localize the value of a function and its Fourier transform. To wit, the following inequality holds:
Other purely mathematical formulations of uncertainty exist between a function f and its Fourier transform – see Fourier transform#Uncertainty principle. A variety of such results can be found in (Havin & Jöricke 1994) or (Folland & Sitaram 1997); for a short survey, see (Sitaram 2001).
In the context of signal processing, particularly time–frequency analysis, uncertainty principles are referred to as the Gabor limit, after Dennis Gabor, or sometimes the Heisenberg–Gabor limit. The basic result, which follows from Benedicks's theorem, below, is that a function cannot be both time limited and band limited (a function and its Fourier transform cannot both have bounded domain) – see bandlimited versus timelimited. Stated alternatively, "one cannot simultaneously localize a signal (function) in both the time domain (f) and frequency domain (Fourier transform)". When applied to filters, the result is that one cannot achieve high temporal resolution and frequency resolution at the same time; a concrete example are the resolution issues of the short-time Fourier transform – if one uses a wide window, one achieves good frequency resolution at the cost of temporal resolution, while a narrow window has the opposite trade-off.
Alternative theorems give more precise quantitative results, and in time–frequency analysis, rather than interpreting the (1-dimensional) time and frequency domains separately, one instead interprets the limit as a lower limit on the support of a function in the (2-dimensional) time–frequency plane. In practice the Gabor limit limits the simultaneous time–frequency resolution one can achieve without interference; it is possible to achieve higher resolution, but at the cost of different components of the signal interfering with each other.
Amrein-Berthier (Amrein & Berthier 1977) and Benedicks's theorem (Benedicks 1985) intuitively says that the set of points where f is non-zero and the set of points where is nonzero cannot both be small. Specifically, it is impossible for a function f in L2(R) and its Fourier transform to both be supported on sets of finite Lebesgue measure. A more quantitative version is due to Nazarov (Nazarov 1994) and (Jaming 2007):
One expects that the factor may be replaced by which is only known if either or is convex.
The mathematician G. H. Hardy (Hardy 1933) formulated the following uncertainty principle: it is not possible for f and to both be "very rapidly decreasing." Specifically, if f is in L2(R), is such that
and
then, if while if then there is a polynomial of degree such that
This was later improved as follows: if is such that
then
where is a polynomial of degree and is a real positive definite matrix.
This result was stated in Beurling's complete works without proof and proved in Hörmander (Hörmander 1991) (the case ) and Bonami–Demange–Jaming (Bonami, Demange & Jaming 2003) for the general case. Note that Hörmander–Beurling's version implies the case in Hardy's Theorem while the version by Bonami–Demange–Jaming covers the full strength of Hardy's Theorem.
A full description of the case as well as the following extension to Schwarz class distributions appears in Demange (Demange 2010):
Theorem. If a tempered distribution is such that
and
then
for some convenient polynomial and real positive definite matrix of type .